CDS

Accession Number TCMCG017C34395
gbkey CDS
Protein Id OMO52365.1
Location complement(join(79826..80023,80124..80192,80288..80335,80441..80536,80615..80725,80816..80900,81016..81128,81242..81307,81422..81541,81637..81684,81818..81889,81974..82036,82221..82292,82412..82557,82660..82879,83043..83099,83205..83233,83432..83489,83587..83754,83877..83943,84038..84138,84271..84384,84462..84533,84625..84708,84811..84930,85046..85220,85357..85445,85556..85639,85743..85949))
GeneID InterPro:IPR000602
Organism Corchorus olitorius
locus_tag COLO4_37248

Protein

Length 983aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA215141, BioSample:SAMN03160584
db_source AWUE01023868.1
Definition hypothetical protein COLO4_37248 [Corchorus olitorius]
Locus_tag COLO4_37248

EGGNOG-MAPPER Annotation

COG_category G
Description Lysosomal
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko04131        [VIEW IN KEGG]
KEGG_ko ko:K01191        [VIEW IN KEGG]
EC 3.2.1.24        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00511        [VIEW IN KEGG]
map00511        [VIEW IN KEGG]
GOs GO:0000323        [VIEW IN EMBL-EBI]
GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005737        [VIEW IN EMBL-EBI]
GO:0005764        [VIEW IN EMBL-EBI]
GO:0005773        [VIEW IN EMBL-EBI]
GO:0043226        [VIEW IN EMBL-EBI]
GO:0043227        [VIEW IN EMBL-EBI]
GO:0043229        [VIEW IN EMBL-EBI]
GO:0043231        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044444        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGCGAGTGGGAGGTGGTGGATTTTGATTGCAATATTGTTGTGCATTTGCTGGTGGTGCGTGGAAGCCAAGTATATTGTTTACAATACGACGTCGAAGATAGTTCCTGGGAAACTTAATGTTCATTTGGTGGCTCATACGCATGATGATGTTGGTTGGTTGAAGACTGTGGATCAGTATTATGTCGGCTCCAATAATTCCATTCAGGGAGCATGTGTTCAGAATGTTTTGGATTCAATAGTGCCGGCACTTTTAGCAGATAAGAACCGGAAGTTCATATATGTTGAACAGGCATTTTTCCAGCGGTGGTGGAGAGATCAGAGTGAGGCAGTCCAAGAGACCGTGAAGAAGCTTATCAACTCGGGTCAACTAGAGTTAATAAATGGGGGCATGTGTATGCATGATGAGGCAGCCCCACATTACATTGATATGATAGATCAGACAACTCTTGGACACCGATTTATAAAACAAGAATTTAATGTGACCCCTAGGATTGGTTGGCAAATTGATCCCTTTGGACATTCTGCTGTGCAGGCTTACTTGCTGAGTGCAGAGGTTGGATTTGATTCACTTTTCTTTGGGCGAATCGACTACCAAGATAGAGCAAAAAGGAAAGATGACAAGAGCCTTGAAGTTGTATGGCGCGGCTCTAAGAGTCTCGGTTCATCAGCACAGATATTTGCTGGTGCATTCCCTCAGAATTATGAACCTCCCAGCAATTTCTACTATGAAGTTAATGATGATTCCCCAATTGTCCAGGATAACATGGAGTTGTTTGACTACAATGTTCCTGAGCGTGTGAATGAGTTTGTAGCAGCTGCATTATCTCAGGCTAACATAACTCGCACAAACCATGTAATGTGGACTATGGGAACAGATTTCAAGTATCAATATGCACACACATGGTTCCGGCAAATGGACAAGTTCATTCATTATGTTAATCAAGATGGGCGTGTCAATGCCCTGTATTCAACCCCATCGATATATACTGATGCCAAATATGCAGCAAATGAGGCCTGGCCACTCAAGACTGATGACTACTTTCCATATGCGGATGAAATAAACGCCTACTGGACTGGATATTTTACAAGCAGGGCAGCACTCAAAGGTTATGCAGCAAGGCAATTAGAGTTTTTCATGGGACGAAGTAAAGTGGGACCTAACACTGACTATTTAGGTGATGCTCTAGCTCTTGCTCAGCATCATGATGCAGTCAGCGGTACTTCAAAACAGCATGTGGCTAATGATTATGCCAAAAGGCTAGCCATTGGCTACGAGCAGGCTGCAAAGGTGGTTCAGACATCACTAGCCAGTTTAACAAGGTCCTCTTCAAAGACGGAGGTTGATCTGTCAAATGGGAAAAATCTGGTGGTTGTTGTCTACAATCCCTTGGGATGGAAAAGAGATGACATAATAAGAATTCCTGTTCTTGACGAGAATGTCATTGTTAAGGATTCTAGTGGAAAAGAAATTGAATCACAGCTTCTACCTCTGCAAAATGCATCTTTGGCCATAAGAAACTACTATTCTGTCGCTTATTCGGGTAAATCTCCAAGTGTTACCCCAAAGTATTGGCTTGCATTTTCAGCATCTGCACCGCCTATTGGTTTCAACACTTACTTCATCTCAAGAGGCAAACGACCAGCTGCAGCTACTATTTCAAAGAGCCAGACAGTCTACAGTTCTGAAGAAAAACAAAATGATGCCATTGAAATAGGGCCAGGAGACCTAAAACTAGTGTATTCTAGAAAACAACAAAAACTGGTTCGCTATATTAATAGCAGAACTAAGGTTGAAGAATCTGTAGGTCAATCATATACCTACTATTCTGGATATGATGGAAGTTTGGTAAACGATTCACAGGCCTCTGGAGCATACATCTTTCGTCCGAATCTCACTTATCCTATCACATCATCTGATGGCCAGGCTTCTTTTACCGTTTTGCGCGGACCATTGTTGGATGAAGTACACCAGAGAATCAATTCGTGGATATATCAGACCACAAGAGTGTACAAAGGAAAGGAGCATGCTGAAGTTGAGTTCACTGTTGGGCCTATTCCTATTGATGATGGAATTGGGAAAGAAGTTGTGACTCAGATTTCAACATATTTGAAAACCAACAAAACTTTCTACACAGATTCCAGTGGTCGCGATTTCATCGAAAGGATTCGAGACTATAGAAAAGACTGGAACCTACAAGTGAATCAACCTATTGCTGGAAACTATTACCCTATCAATCTTGGAATTTACAGTAAAGATGATAGCAAGGAGCTTTCAATCTTAGTCGACCGATCTGTAGGTGGATCCAGCATTAAGGATGGTCAATTGGAACTAATGCTTCATAGGAGGTTGCTTTATGATGATGCTAGAGGTGTTGGAGAAGCTCTAAATGAAACAGTTTGTGTTCAAAATAAATGCACAGGACTAACTGTTGTGGGGAAGTATTACCTTAGAATAGATCCTCTCGGAGAGGCAGCTAAGTGGCGTCGATCATTTGGTCAGGAGATCTATTCTCCATTCCTCTTAGCCTTTACGGAGCAGGATGGAAATGGATGGAAGAATTCCAATGTATTATCCTTTACAGGAATGGACCCTTCCTATACTTTACCTGATAATATTGCAATGATTACCCTCCAGGAATTGGACAATGGAAAAGTTCTTCTTCGGTTGGCACACTTATATGAGGTTGGAGAGGACAATGATCTTTCAGTCATGGCAAGTGTAGAATTAAGAAAAGTATTTGCACATAAGAAGATTAACAAAGTAACAGAAATGAATCTATCTGCTAACCAAGGAAGAACAGAAATGGAAAAGAAGAGACTTGTATGGAAAACCGAAGGCTCTTCGGGAGAGTCTCCGAAGGCGGTCAGGGGAGGACCTGTTGATCCTGCAGCTTTGGTGGTTCAACTTGCTCCAATGGAAATCCGAACCTTTGTAATTGACTTGTATTAG
Protein:  
MASGRWWILIAILLCICWWCVEAKYIVYNTTSKIVPGKLNVHLVAHTHDDVGWLKTVDQYYVGSNNSIQGACVQNVLDSIVPALLADKNRKFIYVEQAFFQRWWRDQSEAVQETVKKLINSGQLELINGGMCMHDEAAPHYIDMIDQTTLGHRFIKQEFNVTPRIGWQIDPFGHSAVQAYLLSAEVGFDSLFFGRIDYQDRAKRKDDKSLEVVWRGSKSLGSSAQIFAGAFPQNYEPPSNFYYEVNDDSPIVQDNMELFDYNVPERVNEFVAAALSQANITRTNHVMWTMGTDFKYQYAHTWFRQMDKFIHYVNQDGRVNALYSTPSIYTDAKYAANEAWPLKTDDYFPYADEINAYWTGYFTSRAALKGYAARQLEFFMGRSKVGPNTDYLGDALALAQHHDAVSGTSKQHVANDYAKRLAIGYEQAAKVVQTSLASLTRSSSKTEVDLSNGKNLVVVVYNPLGWKRDDIIRIPVLDENVIVKDSSGKEIESQLLPLQNASLAIRNYYSVAYSGKSPSVTPKYWLAFSASAPPIGFNTYFISRGKRPAAATISKSQTVYSSEEKQNDAIEIGPGDLKLVYSRKQQKLVRYINSRTKVEESVGQSYTYYSGYDGSLVNDSQASGAYIFRPNLTYPITSSDGQASFTVLRGPLLDEVHQRINSWIYQTTRVYKGKEHAEVEFTVGPIPIDDGIGKEVVTQISTYLKTNKTFYTDSSGRDFIERIRDYRKDWNLQVNQPIAGNYYPINLGIYSKDDSKELSILVDRSVGGSSIKDGQLELMLHRRLLYDDARGVGEALNETVCVQNKCTGLTVVGKYYLRIDPLGEAAKWRRSFGQEIYSPFLLAFTEQDGNGWKNSNVLSFTGMDPSYTLPDNIAMITLQELDNGKVLLRLAHLYEVGEDNDLSVMASVELRKVFAHKKINKVTEMNLSANQGRTEMEKKRLVWKTEGSSGESPKAVRGGPVDPAALVVQLAPMEIRTFVIDLY